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Abstract 


We investigate exact indexing for high dimensional l p norms based on the 1-Lipschitz property 
and projection operators. The orthogonal projection that satisfies the 1-Lipschitz property for the l p 
norm is described. The adaptive projection defined by the first principal component is introduced. 

Keywords'. 1-Lipschitz property; curse of dimensionality; nearest neighbour; high dimensional indexing; 
l p normCognition 

1 Introduction 

While there are relatively efficient approximate similarity search algorithms, it is widely supposed that 
the exact search suffers from dimensionality IfTTIl . Thus, solving the problem in the most general case 
for an arbitrary dataset is impossible. We investigate exact indexing for a vector space V and a distance 
function d. Exact indexing is based on exact similarity search, and no data points are lost during range 
queries. For a range query vector y from a collection of s vectors, 


Xi,X 2 ,X 3 ,--- ,x s 


all vectors x, that are £-similar according to the distance function d are searched 


dfa, y) < £. 


( 1 ) 
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In approximate indexing, the data points that may be lost as some distances are distorted. Approximate 
indexing 00, (71 seems to be in some sense free from the curse of dimensionality, (Til . Distance-based 
exact indexing is based on the 1-Lipschitz property ifTTil . A mapping function F() maps two vectors x 
and y into a lower dimensional space, where d is a metric in the original space and df eatU re is a metric in 
the feature space that satisfies the 1-Lipschitz property 

dfeature{F(x),F(y)) < d(x, j). (2) 

This equation is also known as the lower bounding postulate 0, |ffl. Using the 1-Lipschitz property, a 
bound that is valid in both spaces can be determined. The distance from similar vectors to a query vector 
y is smaller or equal in the original space and, consequently, is smaller or equal in the lower dimensional 
space as well. During the computation, all the points below the bound arc discarded. In the second step, 
the wrong candidates are filtered by comparisons in the original space. The application of the 1-Lipschitz 
property as used in metric trees and pivot tables does not resolve the curse of dimensionality, as shown in 
ifTOI . For high-dimensional spaces, the functions that obey the 1-Lipschitz property discard fewer points 
as the number of dimensions grows (Toil . The number of points discarded drops as fast as the number of 
dimensions grows. As stated in fill , every 1-Lipschitz function concentrates sharply near its mean (or 
median) value, which results from the fact that a sphere with a constant radius increases exponentially 
with growing dimensions. A linear radial increase leads to an exponential increase of points inside the 
sphere 0, iflTI . which leads to a degradation of the method’s performance. This situation leads to 
the “curse of dimensionality”, which states that for an exact nearest neighbor, any algorithm for high 
dimension d and n objects must either use an ^-dimension space or have a query time of n xd (3, lHHl - 

However, ifTZl . |[T3l . and llT4l show how the recursive application of the 1-Lipschitz property can be 
used to overcome the curse of dimensionality for certain cases of points equally distributed by subspace 
trees. A high-dimensional space is divided into low-dimensional sub-spaces lfl2l . MT3ll . In the low¬ 
dimensional sub-spaces, 1-Lipschitz functions can be successfully applied. The main contributions of 
this paper arc as follows: 

• Introduction of a new adaptive projection. The optimal projection is not fixed by orthogonal 
projection but learned. 

• Extension of the technique beyond the Euclidean norm (L). Many applications rely on the l\ 
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norm. It is shown that ]\ norm gives better results than h norm. 


• Simplification of the mathematical framework. 

The paper is organized as follows: 

• We review the projection operators. 

• We introduce the adaptive projection and the l p norm dependency. 

• The adaptive projection and the l p norm dependency are integrated into the subspace tree. 

• We empirically compare the adaptive projection with the orthogonal mapping. We empirically 
compare the h,h, h and L, norms. 

2 Projection Operators 

Ideally, the mapping function F() should preserve the exact distances 0], 0. An example of such a 
function for real vectors is a norm preserving linear operator Q. Such an operator can be represented 
by an orthogonal matrix with Q T = Q 1 performing a rotation or a reflection. An example of such an 
operator is the Karhunen-Loeve transform, which rotates the coordinate system in such a way that the 
new covariance matrix will be diagonal, resulting in each dimension being uncorrelated. A mapping 
that reduces the dimensionality of a vector space can be represented by a projection operator in a Hilbert 
space, which extends the two or three dimensional Euclidean space to spaces with any finite or infinite 
number of dimensions. In such a space, the Euclidean norm is induced by the inner product 


= v / (4x>- 


(3) 


x 


If W is a subspace of V, then the orthogonal complement of W is also a subspace of V. The orthogonal 
complement W L is the set of vectors 


W ± = {yGE|(y|x)=0 x e V} 


(4) 


and 


v = wew 1 -. 


(5) 
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Each vector x E V can be represented as x = xw + x w with Xu/ G W and x w ± E W 1 . The mapping 
P ■ x = xw is an orthogonal projection. Such a projection is always a linear transformation and can be 
represented by a projection matrix P. The matrix is self-adjoint with P = P 2 . An orthogonal projection 
can never increase a norm 


|P-x || 2 = ||x w || 2 < ||x w || 2 + ||x w _l || 2 = ||x w + x w _l || 2 = 


( 6 ) 


Using the triangle inequality 


x + y|| < ||x|| + ||y 


setting 


||y + (x-y)|| < ||y|| + ||x-y 


the tighten triangle inequality 


|x|| - ||y|| < ||x —y| 


(7) 

( 8 ) 

(9) 


follows. From the fact that the orthogonal projection can never increase the norm and the tightened 
triangle inequality, any orthogonal projection operator has the 1-Lipschitz property 


||/ , -x||-|| J P-y|||< ||P-x —P-y|| = H-P- (x y)|| < ||x-y|| 


( 10 ) 


It follows that any projection satisfies the 1-Lipschitz property, which means that the lower bounding 
postulate Q,|01 and any orthogonal projection are satisfied. For example, the “Quadratic Distance 
Bounding” theorem is satisfied 0. There is no the need for a more complicated proof based upon the 
unconstrained minimization problem using Lagrange multipliers 0. 

2.1 Projection onto one-dimensional subspace 

For ||p|| = 1, p • p T is an orthogonal projection onto a one-dimensional space generated by p. For 
example for the vector 

P= f—7=,— ,-j=) (U) 

V V n V n V n J 
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the orthogonal projection from R n onto one-dimensional space R is 


P = P P = 


/ 1 

n 


1 1 


\ 1 1 

\ n n 


n / 


( 12 ) 


For f <n orthogonal subspaces 

R” = £j©£ 2 © ...©£> (13) 

of the vector space R n we can define a projection P : R" >->• Rf as a sum of / projections onto one 
dimensional space 

P = V\ •Pl' + Pl'Pj ( 14 ) 

with p, p/ : E; i—> R and ||p ( || = 1. The 1-Lipschitz property of the projection from the subspace E, the 
one dimensional space R is 


IP/ * P/" ■ *11 


llPrP^-yl 


< |||x|| - ||y||| < ||x — y|| 


(15) 


The projection P, represented by Equation [141 should distort the distances between the vector space R n 
and R* as little as possible. As a consequence, the distortion for each subspace E, should be minimized. 
Because of the 1-Lipschitz property for the one-dimensional space, according to the Equation [15] we 
need to minimize the distance in the one-dimensional space between the length of the vector and the 
length of its projected counterpart 


P, Pi ' x 



(16) 


Suppose the dimensionality of the subspace Ej is m. We define the vector a as 


a = PrPj ' x 


(17) 


It follows that 


a = \Jm-a = ||a|| = ||p, -p^ -x| 


(18) 
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and with 


a i = Cl2 = ■■■ = Clk = ■■■ = = & 


a — (r?i; 0-21 ■ -Ojtj ■ flm)• 

With a being the length of the projected vector we preform the following operation 

min{|a— ||x|||}. 

From the tighten triangle inequality, it follows that 

minja— ||x||} < min{||a —x||} 

according to the Euclidean distance function. To minimize the Euclidean metric ||a 
choose the value of a Ifl4ll ? It follows that 

min {x\ - a) 2 + (xo - a) 2 +... + (x m - a) 2 ^ 


n _ dd(x,a) _ m-a- (L”ii *i) 

da v //wa 2 + i;i 1 Jt?-2-a-(ir= 1 ^) 

with the solution 



m 


which is the mean value of the vector x. It follows 


a = 




m 


PrPrx 


(19) 

( 20 ) 

( 21 ) 

x||, how do we 

( 22 ) 

(23) 

(24) 

(25) 
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with the corresponding projection matrix Pj 


( 1 

m 

1 


Pi = Vi vi = 


j_ 

m 


\ 1 1 

\ m m 


m 


- i 

m / 


Pj is generated by the normalised vector p, 


1 1 

m' x/m' 


( 26 ) 


(27) 


which indicates the direction of the m-secting line, which is a continuous map from a one-dimensional 
space to an m-dimensional space given by 


X\ = X\ 
X 2 — X\ 
X 3 =X\ 


(28) 


x m — X 1 


For m = 2, this equation is the bisecting line with x\ = X 2 or, represented as a curve, 


X\ = Xl 
X2 = X\ 


(29) 


which, for uncorrelated data Pj, is the best projection onto one dimension, as indicated in next section. 
The projection can be computed efficiently without needing matrix operations as the mean value of the 
vector multiplied with the square root of its dimensionality. 


m • 


ir=t *,• 


m 


(sk 

m 

Elk*, 


-1 

m 


m 


m 


_j_ 

m 


\ 1 1 

\ m m 


m 


1 

m 


\ 

(*') 



X 2 


/ 

^ Xm j 



(30) 
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A projection P : R n h-» R f given the decomposition into / orthogonal spaces according to Equation |T3] is 
composed of a sum of / projections onto a one-dimensional space. Each projection is a projection on 
an m-secting line with P, : R'" R. The method works with the space split in any way. For simplicity, 
we assume that the //-dimensional space is split into / equal-dimensional subspaces. In this case, the 
projections are efficiently computed as the mean value of each sub-vector. The corresponding mean 
values are multiplied with the constant c = s/m = ^J~^. The selection of the division can be determined 
by empirical experiments in which we relate m to n with the constraint that n is divisible by m. 

2.2 The First Principal Component 

The covariance matrix represents the tendency of two dimensions varying in the same direction as indi¬ 
cated by the data points. The Karhunen-Loeve transform rotates the coordinate system in such a way that 
the new covariance matrix will be diagonal. Therefore, each dimension will be uncorrelated. The trans¬ 
formation is described by an orthonormal matrix, which is composed of the normalized eigenvectors of 
the covariance matrix. The squares of the eigenvalues represent the variances along the eigenvectors. 
The first principal component corresponds to the normalized eigenvector z with the highest variance. 

||z|| = 1 with Z = z - z T is the best projection onto one-dimensional space because, in a Hillbert 
space, the first principal component passes through the mean and minimizes the sum of squares of the 
distances of the points from the line. It follows that 

||x|| > ||P-x|| > ||Z-x||. (31) 

For uncorrelated data, Z = P represents the projection on the m-secting line. For correlated data contrary 
to the projection on the m-secting line, all the components of the vector z do not need to be equal, and the 
projection cannot be computed efficiently. For a vector o of length s/m in the direction of the m-secting 
line where P is the projection on the m-secting line, 


o = (1,1,1,---,1) 


( 32 ) 


it follows that 




The value 


y/m— ||Z - o|| (34) 

indicates the diversion from the m-secting line with value 0 corresponding to uncorrelated data and 
sjm — 1 one dimension data with 


sfm—l>y/m—\\Z-o\\>0. (35) 

For a given decomposition into / orthogonal spaces according to the Equation [13] the data points are 
mapped into corresponding subspaces E\. For each subspace £) the covariance matrix C, is computed. 
In the next step for each covariance matrix C, the first principal component with the highest variance is 
determined. It is represented it by the normalised eigenvector zEach projection 

Zi = z/ • zj (36) 

is a projection onto the first principal component with Z, : R m R. An adaptive projection A : R" i->- R f , 
given the decomposition into / orthogonal spaces according to the Equation [13] is composed of a sum 
of / projections Z, onto a one-dimensional space. 

A = z\-z]+ z 2 -zj ... + Zf-zJ. (37) 

The method works under any splitting of the space, such as the projection P : R n t-p- R f . 

2.3 l p norm dependency 

Some applications require distance functions that differ from the Euclidian distance function. In addition 
to the Euclidean distance function, the Manhattan distance and the Chebyshev distance function are 
commonly used. In the following, we generalize the Euclidean norm to the l p norm that induces a 
corresponding metric. The l p norm is defined as the following (for p = 2 it is the Euclidean norm): 

ll x llp = (|m| p + M p H- \-\x,n\ p )~ p (38) 
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Ip norms arc equivalent and the following relation holds for 0 < q < p 

i_i 

IWIp < \\x\\ q <mi p ■ ||x|| p (39) 

and 

i_i 

mp « ■ ||x|| 9 < \\x\\ p < ||x|| 9 . 

The tighten triangle inequality is valid in any l p norm due to the definition of norm 
norms arc equivalent the following equation is valid as well for any l p norm 

\\ p - x \\ p - P'-ylll, < ll^-x-^-yllp = ll^-( x -y)llp < ll x ~y||p (41) 


(40) 

. Because the l p 


and 


||Z-x|| p < 


|F-x|| p < 


ip- 


(42) 


The lineal - projection operator P has the 1-Lipschitz property in any l p norm and 


- ir=t *«• 

m” ■ - 

m 


( VLiIl \ 

m 

Eti * 

m 




j_ 

m 


J_ 

m 


m 


\ 

M 



*2 


/ 

^ j 



(43) 


The projection P can be computed efficiently without needing a matrix operation as the mean value of 

i 

the vector multiplied with the constant c = mp. For the dimension m, for the l\ norm, c = m, for the P 
norm, c = s/m, and for the norm, c = 1. A lower l p norm corresponds to a higher constant m>c> 1 
and less information loss. We cannot gain any advantage of the 1-Lipschitz property using the different 
l p norms. The behavior of the constant c is related to the equivalence of the norms relation. For example, 
the /1 and h relation is 

||x||2 < ||x||t < \fm- ||x||2. (44) 


For ||q|| p = 1 with Q = q T • q is a mapping onto one dimensional space generated by q. It is not a 
projection for p > 2 because the matrix is not self-adjoint with Q = Q 1 . The mapping on the m-secting 
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line, the operator can be understood as 


Q = q q = 


~~T 

~~T 

mP 

mP 

1 

1 

2 

2 

m P 

mP 

1 

1 


mP mP 


X \ 

mP 

1 

2 

mP 


(45) 


Q is generated by the l„ normalized vector q indicating the direction of the m-secting line. 


<T 


_ 1 1 1 

Q — I t > t »■ ■ ■) r 

. m p m p m p 


(46) 


The mapping can be computed efficiently without requiring matrix operations as the mean value of the 


p- i 


vector multiplied with the constant d = m p 

( \ 


^4 L,_ l x i 

m p ■ - 

m 


2 
mP 

Lti * 

— T~ 

mP 


V ) 


1 

1 

2 

2 

mP 

mP 

1 

1 

m~P 

m~P 

1 

1 

2 

2 

mP 

mP 


1 

2 

mP 

1 

m~P 


1 

2 

mP 


\ 

) 



X 2 


/ 

^ Xm j 



(47) 


However this mapping can increase a norm. For the norm l p the induced matrix norm is 


WQWp =max||0-x|| p 
\\x-p\ 


(48) 


and for x = q 


WQWp 


P-2 

= m p . 


It follows that for p > 2 


WQ-Ap > II x IIp 


(49) 


(50) 


the norm is increased. Only for p <2 the norm is not increased with h the projection P and l\ the simple 
mean value. 
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3 Subspace tree revisited 


An adaptive projection, A : R n i->- R f , maps two vectors, x and y, into a lower-dimensional space and 
satisfies the 1-Lipschitz property: 


||A-x-A-y|||p < ||x — y|| p . (51) 

Using the 1-Lipschitz property, a bound that is valid in both spaces can be determined. The distance 
of similar vectors to a query vector y is smaller or equal in the original space of dimensionality n, 
and, consequently, it is also smaller or equal in the lower-dimensional space of the dimensionality /. 
During the computation, all the points below the bound arc discarded. In the second step, the wrong 
candidates arc filtered by comparisons in the original space. The number of points discarded drops as 
fast as the relation between the dimensionalities j grows. Depending on the correlation between the 
dimensionalities, the 1-Lipschitz property is only useful if the relation is sufficiently small with 

j < d (52) 

where d varies between 2 < d < 16 in relation to the data set. However, high-dimensional indexing 
requires that the mapping F : R n R d with n 'F> d satisfies the 1-Lipschitz property. For such a function, 
only a tiny fraction of the points of a given set arc below the bound. Thus, the majority of the points 
have to be filtered by comparisons in the original space. Therefore, no speed up, compared tp the use of 
a simple list matching, can be achieved, as proclaimed by the conjecture “the curse of dimensionality”. 
If at least some points of a given set arc below the bound, there is a way to build a recursive function 
that achieves a considerable speed up using a simple list matching. Motivated by the divide and conquer 
principle and the free structure, one can build such a function recursively, indicating that the “the curse 
of dimensionality” conjecture is wrong for some data sets. It is well known that, for a dimensionality d 
(2 < d < 16), metric index frees operate efficiently. Thus, in the next step we define an efficient indexing 
structure that builds on the mapping F : R" R d that satisfies the 1-Lipschitz property, with F being a 
projection or an adaptive projection. 

Suppose there exist a sequence of subspaces Uq, U\ , U 2 , ■ ■ ■ , U t with R" = Uq and R d = U, in which 
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each subspace is a subspace of another space 


U 0 D Ui D U 2 D • • • D U t 


(53) 


and with dim(Ui ) indicating the dimension of the subspace Ui 

dim(Uo ) > dim(U\) > dim(U 2 ) ■■■> dim(U t ) 

and the relation between neighbouring subspaces is sufficiently small with 

dim(Uo) ^ dim(Ui ) ^ dim(U t -\) ^ 

dim(Ui) ~ ’ <Zwi(t/ 2 ) “ ’" ’ dim(U t )~ 1 j 

We define a family of projections for the sequence of subspaces (either adaptive or not) with the follow¬ 
ing 

A\\Uq^ Ui\A2 : U\ •->- C/ 2 ; U t -\ i->- I7 f . (55) 

The family of projections defines the sequence of subspaces. Given a bound e and a query vector y for 
each subspace including the original space Uq, certain points are below the bound. For each subspace 
Ui, the number of points below the bound e is indicated by the value a,. It follows that 


a 0 < CTi < ... < a, < ^ (56) 

where s is the size of the data set. The resulting computing cost given a bound e and a query vector y is 

t 

cost s = 2^ ' dim(Ui-i ) + s ■ dim(U t ). (57) 

i=i 

The cost of list matching is 

cost / = s ■ dim(Uo) (58) 

The saving cost s < cost) is related to the bound e. Empirical experiments suggest that cost s -C costi for 
a bound with Oo < d. 

The described projection based method cannot be applied to sparse representation, as present in the 
vector space model [T|- 


13 





3.1 Tree isomorphy 


The isomorphy to a tree results from the assumption that the value of a, is reciprocal to the preceding 
dimensionality lfl3l . Therefore, a bigger dimensionality dim(Ui+i ) results in a smaller value <7„ and vice 
versa. We can express this relation by 


const ■ a, = ————-. (59) 

dim(Ui + 1 ) 

The value of const is dependent on the data set and its norm. The value of a, is reciprocal to the 
preceding dimensionality dim(Ui + \) (Equation [59]), and the computing costs are expressed by 


cost s 


/ dim(U 0 ) dim(U\) dim(U„- 1 ) \ 

\dim(Ui) dim(U 2 ) dim(U t ) / 


+ dim(U t ) ■ s. 


(60) 


Supposing d = dim(U t ) and n = dim(Uo) 


costs ~ 1 /const ■ d ■ log^ [n — d) + d ■ s. 


(61) 


For a dimension d, the metric index trees operate efficiently with a nearly logarithmic search time. For 
the bound with Go < d, the value 1 /const <C s 


costs ~ 1 /const - d ■ (log d (n) — 1) + d-log rf (5). 


(62) 


It follows that the lower bound of the computational cost is 


Q(log(n)+log(s)). 


(63) 


4 Examples of £ similarity 


The £ range queries depends on the adequate value of £. A method for the estimation of such a value is 
described in fl2l . Fet DB be a database of s multimedia objects x-' J represented by vectors of dimen¬ 
sionality n in which the index i is an explicit key identifying each object 




( 64 ) 
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For a query object y, all objects x (,) are searched that are £-similar 


l|x (0 -y|| P < £■ (65) 

For high dimensional vector space such a set can be determined by a list matching over the whole data 
set. 


4.1 Computational procedure 

For the database DB that is projected into the subspace f4, 


{u k (x)M eu k (DB)\ie{i..s}}. 


( 66 ) 


The algorithm to determine all £-similar objects is composed of two loops. The first loop iterates over the 
elements of the database DB, and the second iterates over their representatioij* . We can easily parallelize 
the algorithm over the first loop; different parts of the database can be processed by different processors, 
kernels, or computers. 


Algorithm to determine NN 

forall (x6) e DB\i € {1.-5-}} 

{ 

for(k = t\k / 0,k -) 

{ 

load(U k (x)®); 

/* 1-Lipschitz property */ 

i/(||t/ t (x)»-t/ t (y))|| p >=£) 

break 

if (k = 0) print x il> is NN of y 

} 

} 


1 An implementation can be obtained upon request from the author 
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Each call of the 1-Lipschitz property costs dim(Uk)- The cost according to Equation[571 correspond to 
the number of 1-Lipschitz property calls, corresponding to the value O'/,. 

4.2 960-dimensional vector space 

We apply computational procedure on high-dimensional data set of 100000 vectors of dimensionality 
960. The vectors represent the GIST global descriptor of an image and are composed by concatenated 
orientation image histograms [[£)]. The vector x of dimensionality 960 is split into 480 distinct sub¬ 
vectors of dimensionality 2. The data points are described by 480 covariance matrices C, for each 
subspace. For all points, the covariance matrices are computed iteratively. 


X=*l,* 2 ,* 3 ,* 4 ,.,*479,*959 . ( 67 ) 

Ci C2 C480 

The resulting 480 projections, z, -zT, define the adaptive projection A : A' 960 i-)> f? 480 . We apply the 
adaptive projection and the determination of the adaptive projection recursively. The resulting family of 
projections, 

Ar-Uo^r UuA 2 : U\ i->-1/ 2 ; • • • ;Ay :U 6 ^U 7 (68) 

defines the dimensionalities of the subspaces. 


dim(Uo) = 960 > dim{U\) = 480 > dim(TJj) = 240 > dim(Uj,) = 120 


> dim{U/[) = 60 > dim(JJ$) = 30 > dim{JJ(,) = 10 > dim(Uj) = 5. 
In Table [I] we indicate the mean costs according to Equation [57] using the h norm. 


projection 

e for « 52 NN 

mean cost 

ratio 

orthogonal 

6300 

4584277 

21.38 

adaptive 

6300 

4393127 

22.31 


Table 1: Mean ratio of list matching to the mean computation costs according to Equation [57] The 
values were determined over a disjunct sample of S C DB with size |S| = 400. The adaptive projection 
gives only a slight improvement. The diversion from the m-sccting line according to Equation [34] is 

always <C 0.0001. 
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4.3 12288-dimensional vector space 


The 12288-dimensional vector space corresponds to an image database that consists of 9.876 3-band 
RGB (Red, Green, Blue) images of size 128 x 96. Each color is represented by 8 bits fl2l . Each of 
the tree bands of size 128 x 96 is tiled with rectangular windows W of size 4x4. The data points are 
described by 32 x 24 covariance matrices C, for each subspace, for each band. The resulting 768 = 
32 x 24 projections z,- • z J define the adaptive projection A : A' 122X8 i —> R 768 for each band (Red, Green, 
Blue). We apply the adaptive projection and the determination of the adaptive projection recursively. 
The resulting family of projections, 


Ar-Uo^ UvM : U i G 2 ;A 3 :U 2 ^U 3 


(69) 


defines the dimensionalities of the subspaces for each band 


dim(Uo) = 12288 > dim(U\) = 768 > dim(U 2 ) = 48 = 8 x 6 > dim(Uj) = 12 = 4 x 3. 


For an orthogonal projection, the sequence of subspaces Go D U\ D U 2 D G 3 corresponds to the “image 
pyramid” 0, (6), which has a base that contains an image with a high-resolution and an apex that is 
the low-resolution approximation of the image. In Table [2j we indicate the mean costs according to 
Equation 1571 The l\ norm gives the best results. 


projection 

h> 

e for « 52 NN 

cost 

ratio 

orthogonal 

h 

1240000 

8571752 

42.47 

adaptive 

h 

8500 

10343766 

35.20 

orthogonal 

h 

8500 

10386043 

35.05 

orthogonal 

k 

825 

12464281 

29.32 

orthogonal 

loo 

161 

39639239 

9.19 


Table 2: Mean ratio of list matching to the mean computation costs according to Equation [57J The 
values were determined over a disjunct sample of S C DB with size ,Sj = 400. The diversion from the 
m-secting line, according to Equation [34j is always <C 0.0001. 


5 Conclusion 


An adaptive projection that satisfies the 1-Lipschitz property defined by the first principal component 
was introduced. We indicated the behavior of the projections for the l p norms. The Manhattan distance 
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l\ loses the least information, followed by Euclidean distance function. Most information is lost when 
using the Chebyshev distance function. Motivated by the tree structure, we indicated a family of pro¬ 
jections that defines a mapping that satisfies the 1-Lipschitz property. It is composed of orthogonal or 
adaptive projections in the l p space. Each projection is applied recursively in a low-dimensional space, 
where “the curse of dimensionality” conjecture does not apply. 
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